Load-Balancing Spatially Located Computations using Rectangular Partitions

نویسندگان

  • Erik Saule
  • Erdeniz Ö. Bas
  • Ümit V. Çatalyürek
چکیده

Distributing spatially located heterogeneous workloads is an important problem in parallel scientific computing. We investigate the problem of partitioning such workloads (represented as a matrix of non-negative integers) into rectangles, such that the load of the most loaded rectangle (processor) is minimized. Since finding the optimal arbitrary rectangle-based partition is an NP-hard problem, we investigate particular classes of solutions: rectilinear, jagged and hierarchical. We present a new class of solutions called m-way jagged partitions, propose new optimal algorithms for m-way jagged partitions and hierarchical partitions, propose new heuristic algorithms, and provide worst case performance analyses for some existing and new heuristics. Moreover, the algorithms are tested in simulation on a wide set of instances. Results show that two of the algorithms we introduce lead to a much better load balance than the state-of-the-art algorithms. We also show how to design a two-phase algorithm that reaches different time/quality tradeoff.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Load Balancing Unstructured Adaptive Grids for CFD Problems

Mesh adapt(on is a powerful tool for efficient unstructured-grid computations but causes load imbalance among processors on a parallel machine. A dynamic load balancing method is presented that balances the workload across all processors with a global view. After each parallel tetrahedral mesh adapt(on, the method first determines if the new mesh is sufficiently unbalanced to warrant a repartit...

متن کامل

Experiences with Mesh-like computations using Prediction Binary Trees

In this paper we aim at exploiting the temporal coherence among successive phases of a computation, in order to implement a load-balancing technique in mesh-like computations to be mapped on a cluster of processors. A key concept, on which the load balancing schema is built on, is the use of a Predictor component that is in charge of providing an estimation of the unbalancing between successive...

متن کامل

Efficient Support for Matrix Computations on Heterogeneous Multi-core and Multi-GPU Architectures

We present a new methodology for utilizing all CPU cores and all GPUs on a heterogeneous multicore and multi-GPU system to support matrix computations efficiently. Our approach is able to achieve four objectives: a high degree of parallelism, minimized synchronization, minimized communication, and load balancing. Our main idea is to treat the heterogeneous system as a distributed-memory machine...

متن کامل

Perspectives on asynchronous computations for fluid flow problems

Instances where asynchronous parallel computations can be used for the solution of fluid flow problems are discussed. The use of asynchronous portions of solvers can effectively counteract the consequences of the inefficiencies associated with high latency in the interprocessor (or intercomputer) communication. This is especially so in the case of irregular regions, irregular partitions, or in ...

متن کامل

Task-based Parallel Computation of the Density Matrix in Quantum-based Molecular Dynamics using Graph Partitioning

Quantum molecular dynamics (QMD) simulations are highly accurate, but they are computationally expensive due to the calculation of the ground-state electronic density matrix P via an O(N) diagonalization. Second-order spectral projection (SP2) is an efficient O(N) alternative to obtain P from a Hamiltonian matrix H. This poster presents a dataparallel version of the SP2 algorithm that uses an u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • J. Parallel Distrib. Comput.

دوره 72  شماره 

صفحات  -

تاریخ انتشار 2012